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We discuss the feasibility of seeing a Higgs boson which decays to four partons through a pair of 
(pseudo-)scalars at the LHC. We restrict our search to Higgs bosons produced in association with 
a W/Z boson at high transverse momentum. We argue that subjet analysis techniques are a good 
discriminant between such events and W/Z plus jets and tt production. For light scalar masses 
(below 30 GeV), we find evidence that a flavor-independent search for such a non-standard Higgs 
boson is plausible with 100 fb _1 of data, while a Higgs decaying to heavier scalars is only likely to 
be visible in models where scalar decays to b quarks dominate. 
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I. INTRODUCTION 

Significant effort is being made to discover the Higgs 
boson at the Large Hadron Collider. If the Standard 
Model (SM) Higgs boson is ruled out in all of its allowed 
mass range, the next step will be to find out if a Higgs (a 
remnant of weakly coupled electroweak symmetry break- 
ing) exists at all. In doing so, one must consider all 
possible reasonable decay modes due to new physics. 

The phenomenology of a SM Higgs boson with mass 
77i/i < 135 GeV is dominated by the relatively weak 
Yukawa couplings characterized by the ratio of light 
fermion masses and the vacuum expectation value (vev) 
of the Higgs. Because the Higgs vev (~ 246 GeV) is so 
much larger than the mass of any of the fermions to which 
the Higgs boson can decay, given restriction on m/j from 
precision electroweak tests [l(, these couplings are quite 
small, and therefore the width of the light Higgs boson 
into SM particles is quite low (~ 3 MeV for rrih = 120 
GeV). With such a small decay width, new light parti- 
cles, even with a relatively small couplings to the Higgs, 
could easily be its dominate decay product. Under any 
such model, SM Higgs searches could be rendered useless. 

The simplest possible extension to the Higgs sector in- 
volves the addition of a gauge singlet scalar, a. Var- 
ious new physics models contain such a field, such as 
non-minimal supersymmetry and composite (and Little) 
Higgs models (for a review, see [2J). Given a coupling be- 
tween the Higgs field and the a field A a H 2 a 2 electroweak 
symmetry breaking allows the decay h — > eta. With- 
out any additional terms in the Lagrangian, the a would 
be stable and experimentalists would need to search for 
an invisible Higgs - such a phenomenological possibility 
has been studied in, e.g. Q. Other reasonable possibili- 
ties include a model in which the a and the Higgs boson 
mix, in which case the a picks up SM Higgs branching 
ratios to SM particles. For most of the reasonable pa- 
rameter space, this would lead to the dominant decay 
chain h — > 2a — > Ab. Searches for this final state have 
been performed with LEP data and a lower bound of 110 
GeV has been set Q. Other simple and natural models, 
where a is a pseudo-Goldstone boson Q, can easily al- 
low a — > gg (gluons) to dominate (see for example [6J|) or 
lighter quarks (see 0)- 



Previous work has been done on the phenomenology 
of h -> Ab at the LHC @,|. The backgrounds for such 
signals are large, as are those for the SM h —> bb. The 
SM case was approached more recently by looking at the 
phase space where the Higgs is produced in association 
with a vector boson, and the two particles are produced 
with large pr (lOj . Not only are leptonic decays of the 
gauge boson a nice discriminant from QCD production, 
the SM background (W and Z plus jets) drops much 
faster than the signal at large px due to the fact that the 
background is dominated by t-channel diagrams while 
the signal is dominated by the s-channel. Their analysis 
used jet deconstruction and reconstruction tools partly 
developed in early work [ll[ ■ The analysis of the boosted 
Higgs [10| showed that a once hopeless signal became 
hopeful. 

In this paper, we discuss the application of boosted 
Higgses analyses to non-standard Higgs decays can have 
a dramatic impact on the reach of the LHC, not only in 
the h — > Ab channel, but in the notorious h — » Ag chan- 
nel as well. Vector boson plus jets and tt production will 
provide significant backgrounds to our signal, but we find 
that discovery significance can be achieved in both chan- 
nels for significant ranges of a masses with more than 100 
fb _1 of data. Work along these lines with much smaller 
ranges of parameters studied [l2l [l3j , or with significant 
model dependence [1J|, have appeared previously. This 
work had previously appeared in a PhD. thesis [15j . 



II. ANALYSIS AND RESULTS 

We generate both signals and backgrounds using the 
MadGraph/MadEvent v4.3 package Tl6l [l7| ■ These 
events are showered through Pythia [18| and jet clus- 
tering is performed using the routines in the Fast Jet [l9j 
libraries. 

Vector bosons which decay hadronically are difficult to 
distinguish from hadronic activity arising through QCD, 
so we proceed based on an analysis of leptonic decays of 
vector bosons. These come in three types for our signal: 
Zh i + £-h, Zh $rh an d W ± h -> £ ± ^ T h, where 
represents a charged light lepton (muon or electron). 
In order to simulate lepton isolation cuts applied experi- 
mentally, we neglect from our analysis leptons which have 
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TABLE I. Cross-sections in picobarns after successive cuts 
on leptons and IpT for associated Higgs production and back- 
grounds. The "Inclusive" column gives the production cross- 
section with no cuts. The "Vpr" column gives the production 
cross-section for events in which there is a vector boson (W 
or Z) with pr > 200 GeV (including all decay modes for this 
boson). These cross-sections are presented to leading order. 



Process 


Inclusive 


V pr 


i + r 


t + $T 


#r 


Wh 


0.87 


0.047 


< 0.001 


0.011 


< 0.001 


Zh 


0.74 


0.038 


0.0018 


< 0.001 


0.0063 


W + jets 


28600 


180 


< 0.1 


30.8 


1.8 


Z + jets 


9300 


80.6 


3.0 


0.5 


15.5 


ti 


610 


54.3 


< 0.1 


8.4 


2.2 



pr < 10 GeV, or for which there is significant hadronic 
activity in an isolation cone of size AR = 0.4 around the 
lepton (significant meaning a scalar sum of px more than 
20% of the px of the lepton itself) . Leptons vetoed in this 
way are included in the hadronic activity of the event. 

Events are then classified into one of three channels; 
the first requires the presence of two opposite-charge, 
same flavor leptons which reconstruct to an invariant 
mass of 91 ± 10 GeV with total p T > 200 GeV. Fail- 
ing this, events with a lepton and missing Et which re- 
construct to an invariant transverse mass of at most 90 
GeV (10 GeV greater than the W mass) with a total 
Pt > 200 GeV are accepted into the second channel. 
Finally, events which simply have IpT > 200 GeV are 
accepted via the third channel. 

The energy resolution on electrons and muons at AT- 
LAS and CMS is expected to be of the order of a few 
percent, while the resolution on missing Et is expected 
to be < 10 GeV for events in which the scalar sum of px is 
< 500 GeV. We neglect the energy and angular resolution 
on isolated light leptons. The production cross-sections 
pp -> Vh, pp -> V + jets, and pp -> it W + bW~b 
are relatively flat as a function of the pr of the outgoing 
vector boson (~ 10% difference from 190 — 200 GeV, and 
from 200 — 210 GeV). The effect of smearing is to change 
the number of events that pass the cuts by at most a few 
percent, and thus we ignore the missing Et smearing. 

As can be seen from Table U the efficiency with which 
signal events (in which the boson decays leptonically or 
invisibly and has sufficient pr) pass these cuts is ~ 75%, 
and most of the passed events are characterized properly 
(e.g. events with a W boson pass the lepton + missing 
Et cut). However, a significant minority of ti pass the 
pure ]pr£ cut instead of I + $t, due to the charged lepton 
being non-central or too close to a hadronic jet. 

Once the cuts on reconstructed bosons have been ap- 
plied, we proceed to an analysis of the remaining outgoing 
energy (we will refer to this as the "hadronic" side of the 
event, though it includes outgoing particles which will be 
captured by the EM and hadronic calorimeters as well as 



the muon chambers). The calorimeters at the LHC have 
finite angular granularity on the scale of A<fi ~ Arj < 0.1 
in most of the acceptance region [2(| ■ As we will be 
doing an analysis of the substructure of jets on a scale 
not too much larger than this, we simulate it by binning 
hadronic energy into 63 equally-sized bins in azimuthal 
angle 4> and into 100 equally-sized bins in pseudorapidity 
(for —5 < r\ < 5. We do not simulate the finite energy res- 
olution of the EM and Hadron calorimeters themselves, 
but we expect this effect to be sub-dominant to the ef- 
fects of the jet algorithms and the finite granularity. The 
energy resolution is expected to be of the order of 10% 
for the relevant energy scales [2l| while the cuts we will 
perform assume resolutions somewhat worse than this. 

Once this binning has been accomplished the resulting 
"particles" (representing energy deposits in the calorime- 
ter) are clustered according the Cambridge- Aachen jet al- 
gorithm with a merging cutoff parameter of R m ax = 1.2 
[22J. This value of R ma x was optimized empirically for a 
light Higgs mass mh = 120 GeV in order to capture the 
decay products of the Higgs boson in a single jet. Jets 
are only kept if the clustered jet has pt > 30 GeV. 

The Cambridge-Aachen jet algorithm has a useful 
property; as it is a monotonic iterative procedure (at 
every step objects can only be merged, never unmerged) 
it is possible to work backwards from a single final jet to 
study its component subjets. The method by which we 
accomplish this is similar to the that used in the study 
of boosted top quarks in (23[. The candidate Higgs jet 
(taken to be the highest pt jet in each event) is itera- 
tively decomposed. At every iteration every (sub)jet is 
examined to determine if it can be broken down further 
according to two criteria: that the two components are 
separated by AR > R m i n and each has pt > (p T where 
p T is the transverse momentum of the original Higgs can- 
didate jet and R m in and £ are two input parameters, set 
to 0.31 and 0.2 respectively in this analysis (R m in is cho- 
sen so that it is not an exact multiple of cell spacing). 
The results we obtain are not particularly sensitive to 
these choices for a range of parameters 0.2 < R m in < 0.5 
and 0.15 < ( < 0.3. 

The examination of every (sub)jet is itself iterative; if 
the two immediate parents of a (sub)jet do not pass the 
above criteria then the leading parent (in pt) is itself 
examined to determine if it can be broken in two, and 
so forth, while the softer parent is removed from con- 
sideration. If this procedure reaches the level of primi- 
tive objects (where the objects have no parents) then the 
original (sub)jet being examined is accepted as being in- 
divisible. If, on the other hand, the procedure eventually 
finds two acceptable subjets then the original (sub)jet 
is broken into these two objects. This can mean that 
some of the original (sub)jet's energy being lost as re- 
jected parents are removed from the final objects - we 
describe this removal as "cleaning" t he j et, (essentially 
identical to "pruning", introduced in |24f). Cleaning af- 
fects relatively soft objects well-separated (in AR) from 
most of the energy in the (sub)jet. In Figure[TJwe present 
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FIG. 1. Cross-sections (per 5 GeV bin) versus the recon- 
structed Higgs mass. Cleaned masses have been broken into 
subjets and reconstructed from these. In the plot, "light 
scalars" have a mass of 20 GeV and "heavy scalars", 50 GeV. 
The Higgs mass used to produce the signal was 120 GeV. 
Signals and backgrounds represent the sum of the three re- 
constructed vector boson channels. The data presented are 
for the h — >■ aa — > bbbb model. 



the difference in reconstructed Higgs mass between the 
uncleaned mass reconstructed before breaking the Higgs 
jet into subjets and the cleaned mass reconstructed from 
subjets. If the candidate Higgs does not split into two, 
three or four subjets via the procedure outlined above, 
then the event is rejected, as our goal is to use jet sub- 
structure as a discriminant. 

The phenomenology of h — > 2a — > 4j can be thought 
of as splitting into two disparate regimes: one where the 
intermediate scalar is relatively light (m a < 30 GeV) 
and one where it's relatively heavy. In the former case, 
the intermediate scalars can both be reconstructed fairly 
easily with high efficiency, while in the latter such a full 
reconstruction of the two stage decay isn't possible. We 
therefore proceed with two parallel analyses. 



A. Light Scalars 

Figure [JJ demonstrates that the mass resolution on the 
reconstructed Higgs is relatively unaffected, for signal 
events with a light intermediate scalar, by the choice of 
cleaned versus uncleaned mass. On the other hand, the 
background is reduced by ~ 20% when using the cleaned 
mass. Since the analysis detailed in this subsection won't 
provide a statistically-significant signal at the LHC given 
a heavy intermediate scalar, we optimize for light scalars 
by using the cleaned Higgs mass as our first discriminant. 

We require the search to be performed with a specific 
Higgs mass in mind, and then scan over possible Higgs 
masses. An appropriate search would be for Higgs masses 
between 100 and 150 GeV - the bottom of the range is 



beyond a reasonable guess of LEP sensitivity to Higgs 
decays to four gluons (25[, while the top is set by the 
mass where Higgs to gauge bosons will dominate over 
a weakly coupled scalar. Fine-tuning considerations for 
the light scalar brings the upper limit of this range to 
130 GeV. A scan over Higgs masses can be relatively 
course (every 5 GeV) considering the energy resolutions 
allowed by detectors and the jet finding algorithm. As an 
example, we choose a Higgs mass of 120 GeV (both for 
the Monte Carlo sample and for the simulated search), 
for which we accept events with a reconstructed mass 
between 100 and 125 GeV. 

If the candidate Higgs jet splits into two subjets then 
each of these is assumed to arise from a single intermedi- 
ate scalar. The invariant mass of each of the two jets are 
compared, and the lighter jet is required to be at least 
75% the mass of the heavier jet. If the candidate Higgs 
splits into three subjets then the four-momenta of the 
two lightest subjets are combined and the invariant mass 
of this object is compared to the mass of the heaviest jet. 
Again, the lighter candidate scalar is required to have 
75% of the mass of the heavier candidate. Finally, if the 
candidate Higgs splits into four subjets, then each of the 
three possible pairings of jets into candidate scalars is at- 
tempted and the combination with the closest match in 
mass is assumed to be correct. The same 75% criterion 
is applied in this case. 

In Table [Til we present the effects of the cuts. First 
we present the affect on the effective cross section after 
the combination of the Higgs invariant mass cut and the 
> 2 subjets cut is applied. Second, we present the result 
of the subsequent cuts on the invariant masses of the 
intermediate scalars. The results are shown for each of 
the three samples - dileptons, single lepton with missing 
transverse energy, and only missing transverse energy. 

The mean of the masses of the two candidate scalar 
objects is then used as our final discriminant. As can 
be seen from Figure [5] a mass window of ~ 20 — 30% 
of the central value is reasonably optimal to maximize 
the expected significance of the signal. In Table IIII1 we 
present the expected number of events for various sig- 
nal cases as well as from the major backgrounds falling 
in the peak mass window for each signal case. We also 
show the expected statistical significance calculated sim- 
ply as signal/ -^/background. The results are presented as 
the sum across all three channels, (i), (ii), and (in). All 
of our cuts in this analysis are identical across the three 
channels, and none of the channels appears to be signif- 
icantly favored relative to any other channel by the cuts 
applied on the candidate Higgs jet. We obtain ~ 3.5er 
for m a < 30 GeV. Above this threshold it becomes more 
and more unlikely to properly reconstruct the intermedi- 
ate scalars through the technique above, and the drop-off 
in expected significance is severe. We present results for 
a model with a branching ratio of 1 for a — > bb as well 
as a model with a branching ratio of 1 for a —¥ gg. The 
results are broadly similar, with a — > gg giving somewhat 
better reconstruction efficiencies. This suggests that the 
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TABLE II. Cross-sections (in femtobarns) after various cuts 
for associated Higgs production and backgrounds. Signal 
events with the decays h — > 46 and h — > 4g are presented 
with a range of a masses (in GeV). The m;, columns are after 
a cut requiring that the candidate Higgs jet splits into at least 
two subjets and has cleaned mass between 100 and 125 GeV. 
The Am a columns are after a cut requiring that the recon- 
structed a masses are within 25% of each other. The three 
pairs of columns are for, respectively, i) the dilepton sample, 
ii) the £ + Ifrr sample, and Hi) the pure Ipr sample. 



process 


m h 


Am a 


m h 


Am a 


m h 


Am a 


V + j 


98.2 


24.3 


990.8 


244.7 


511.1 


126.3 


ti 


0.2 


< 0.1 


685.9 


164.7 


154.3 


37.2 


46(15) 


0.73 


0.49 


4.59 


3.08 


2.74 


1.85 


46(20) 


0.77 


0.54 


4.83 


3.33 


2.83 


1.97 


46(30) 


0.68 


0.36 


4.28 


2.21 


2.52 


1.31 


46(40) 


0.38 


0.06 


2.48 


0.37 


1.47 


0.20 


40(15) 


0.76 


0.56 


4.50 


3.32 


2.79 


1.99 


4<?(20) 


0.80 


0.61 


4.78 


3.65 


2.85 


2.17 


4ff(30) 


0.81 


0.46 


4.80 


2.65 


2.82 


1.56 


4<?(40) 


0.52 


0.12 


3.13 


0.60 


1.86 


0.35 
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FIG. 2. Expected events (per 1 GeV bin) versus the recon- 
structed intermediate scalar mass. Two different signal cases 
are presented, with m a = 20 GeV and m a = 30 GeV. Both 
sets of points represent a model where a — > gg is the only 
decay channel. A model in which a — > 66 is dominant would 
give ~ 10% less signal for the m a = 20 GeV case and ~ 25% 
less signal for the m a — 30 GeV case. 



preceding type of analysis provides significant discovery 
potential at the LHC for a large range of intermediate 
scalar masses and any model in which the intermediate 
scalar preferentially decays into pairs of colored objects 
(even if the decay width of the intermediate scalar is 
shared between several channels). Such model indepen- 
dence is a powerful reason to prefer this analysis over, for 
example, an analysis which depends on the flavor-tagging 
of decay products. 



TABLE III. Expected number of events (with 100 ftr 1 of 
data) lying under the peak signal reconstructed mass window 
(in GeV) for various intermediate scalar masses (both for h — >• 
46 and h — > 4g), along with statistical significance a — s/\/b. 
Note the precipitous drop-off in signal above m a ~ 30 GeV. 



Process 


Window 


Signal 


V + j 


ti 


s/b 


a 


46(15) 


12 - 17 


479 


10780 


6320 


2.8% 


3.7 


46(20) 


16 - 22 


536 


16500 


7310 


2.2% 


3.5 


46(30) 


25-31 


317 


5810 


2800 


3.7% 


3.4 


46(40) 


32 - 40 


34 


1130 


670 


1.9% 


0.8 


4fl(15) 


12 - 17 


523 


10780 


6320 


3.1% 


4.0 


4ff(20) 


16 - 22 


608 


16500 


7310 


2.5% 


3.9 


4ff(30) 


25-31 


420 


5810 


2800 


4.9% 


4.5 


4ff(40) 


32 - 40 


65 


1130 


670 


3.6% 


1.5 



The data presented in Figure [2] represent only part 
of the information which would be available to an ex- 
perimentalist pursuing a similar analysis. In order to 
understand experimental backgrounds more fully than is 
possible merely through Monte Carlo simulation, there 
are a number of physical handles which can provide in- 
formation about the individual processes which make up 
the background. For example, the l + l~ channel (i) has 
almost no ti component, while the ti background sample 
can be made more pure by making an extra jet require- 
ment. A further possibility is to look at 7 + jets events. 
This is a purely SM background sample; the topology 
of the diagrams which contribute to such production is 
the same as for those which contribute to the V + jets 
background. Only the mass of the vector boson and a 
single coupling constant differ. Analysis of the hadronic 
components of 7 + jets (or Drcll-Yan plus jet outside of 
the i?-mass window) events could yield substantial un- 
derstanding of the hadronic components of V + jets. 

While we have presented a search strategy for hadron- 
ically decaying a scalars, the dominant a decay in some 
models is to 66. For this case, we can ask how the 
prospects for the search are affected by requiring one 
or two b tags on the Higgs component subjets. The 
6-tagging probabilities we use are estimated using the 
method described in the following subsection. Table |IV] 
shows the effect of requiring b tags on the number of ex- 
pected signal and background events in the appropriate 
peak signal region given in Table IIIII In Figure [3] we re- 
produce the equivalent of Figure [5] for the a — > bb model 
when requiring that at least one subjet receives a b tag. 



B. Heavy Scalars 

As seen above, we cannot rely on reconstructing the 
intermediate scalars properly when m a is much above 30 
GeV. In this section we will discuss the discovery poten- 
tial for the specific Higgs cascade decay h — >• aa —> bbbb. 
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TABLE IV. Expected number of events (with 100 ft)" 1 of 
data) requiring 1 and 2 b tags (first and set set of columns 
respectfully) for various values of m a in GeV. We assume a 
branching ratio of 1 for h — > 46. Signals and backgrounds 
for each m a are those lying under the mass window given in 
Table Mil Background represents the sum of the V + jets 
background and tt background. Also provided is a — -j=. 



m a 


signal 


background 


a 


signal 


background 


a 


15 


414 


3150 


7.4 


191 


122 


17.3 


20 


433 


3600 


7.2 


167 


130 


14.7 


30 


215 


1090 


6.5 


64 


39 


10.3 


40 


21 


230 


1.4 


7 


11 


2.2 
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FIG. 3. Expected events (per 1 GeV bin) versus the recon- 
structed intermediate scalar mass. Two different signal cases 
are presented, with m a = 20 GeV and m a — 30 GeV. Both 
sets of points represent a model where a — > bb is the only 
decay channel. We require that at least one of the candidate 
Higgs subjets receives a b tag. 



The additional discriminant provided by flavor-tagging is 
necessary to provide statistical significance of the signal 
with a heavier intermediate scalar. 

As noted previously, for heavier scalars the uncleaned 
reconstructed Higgs mass is much better resolved than 
the cleaned Higgs mass, so we make our first cut on the 
uncleaned mass, passing events in the window 100 — 125 
GeV. We will be applying a 6-tagging requirement, so the 
£ + Jpr channel (labeled (ii) earlier) will be dominated 
by tt (since virtually every t decay includes a b while 
only a minority of QCD jets in V + jets are 6s). The tt 
events have significantly more hadronic activity than do 
signal or V + jets events. We therefore apply a jet veto 
- all events with at least one additional pr > 30 GeV 
jet are rejected - to reduce the dominant tt background. 
Finally, the fact that this analysis is aimed at heavier 
intermediate scalars allows us to require the candidate 
Higgs jet to split into 3+ subjets. The effect of these two 
cuts is summarized in Table IV] 



TABLE V. Cross-sections (in fb) after a cut on the uncleaned 
reconstructed Higgs mass and a jet veto for associated Higgs 
production and backgrounds. The three sets of columns are 
for (i) the dilepton channel, (ii) the £ + Ifir channel and (Hi) 
the pure J?t channel. Note, a jet veto is not necessary for 
channel (i). The final column is the sum over all channels 
after requiring at least 3 subjets from the candidate Higgs. 



process 


m h 


m h 


Veto 


m h 


Veto 


sub jet 


V + j 


127.2 


1282 


725.3 


658.5 


365.2 


138.6 


tt 


0.3 


833.3 


92.6 


188.3 


11.4 


7.86 


Vh(15) 


0.72 


4.47 


3.21 


2.68 


1.92 


0.05 


Vh(20) 


0.76 


4.70 


3.40 


2.77 


1.97 


0.09 


Vh(30) 


0.71 


4.39 


3.13 


2.59 


1.82 


1.14 


Vh(40) 


0.68 


4.35 


3.15 


2.54 


1.80 


1.81 


Vh(50) 


0.70 


4.50 


3.26 


2.66 


1.90 


1.78 



V+jets [ 
top bair 
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FIG. 4. Expected events (per 5 GeV bin) versus the recon- 
structed Higgs mass given 100 fb" 1 of data. Two different 
signal cases are presented, with m a — 30 GeV and m a = 50 
GeV. Both sets of points represent a model where a — > bb 
is the only decay channel. The data represents events which 
make it through all cuts discussed in this section except the 
cut on mh (jet veto, > 3 subjets, and > 2 6-tagged subjets). 



With these kinematic cuts in place, we turn our atten- 
tion to 6-tagging of the constituent subjets. Our model of 
tagging efficiencies and purities is relatively simple: dis- 
placed vertices arising from b and c hadrons (culled from 
Monte Carlo level information) are assigned to the near- 
est (in the AR measure) subjet, as long as that subjet is 
less than a fixed AR max (set here to 0.2) from the ver- 
tex. Subjets with assigned vertices are tagged with fixed 
efficiencies (eb,e c ) given b and c vertices, respectively. If 
a single subjet is assigned multiple vertices then each 
vertex provides an independent tagging probability (e.g. 
the tagging probability of a subjet assigned two b ver- 
tices is 1 — (1 — e/j) 2 ). Similarly, the tagging probability 
on different subjets is assumed to be independent. If a 
subjet is not assigned any vertices then it may still be 
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TABLE VI. Expected number of signal events (given 100 fb -1 
of data) after b-tagging and all cuts, along with expected sig- 
nal to background ratio and significance (as before). 



m a (GeV) 


Signal 


s/b 


a 


20 


1.9 


2% 


0.18 


30 


37.4 


33% 


3.49 


40 


63.1 


55% 


5.89 


50 


61.0 


53% 


5.69 



tagged with efficiency e;. We use the values = 0.4, 
e c = 0.1, £; = 0.02. We require that at least two subjets 
are 6-tagged. With these efficiencies and requirements, 
the V + jets background cross-section is reduced from 
138.6 fb to 0.93 fb and the tt background from 7.86 fb 
to 0.21 fb. The impact of 6-tagging on our signal is sum- 
marized in Table [VTl We also present, in Figure [U the 
reconstructed unclcancd Higgs mass (as in Figure [1} for 
signal and backgrounds after three cuts are taken into ac- 
count: the jet veto, the splitting of the candidate Higgs 
into 3+ subjets and the 6-tagging 2+ subjets. 



III. CONCLUSION 



By restricting our search to a very specific part of phase 
space (high px associated Higgs production), it is pos- 
sible to strongly reduce the SM background while only 
weakly reducing the Higgs signal. The high px threshold 
on the vector boson leads to a signal sample of high pt 
Higgs bosons, which means that the decay products of the 
Higgs, under any physics model, tend to be collimated in 
a relatively small region in the detector. Using analysis of 
the subjet structure present due to cascade Higgs decays 
through a light scalar we've demonstrated the feasibil- 
ity of a flavor-independent search for scalar masses from 
-15-30 GeV with - 100 fb" 1 . This, for example, is the 
broadest parameter space suggested by any search strat- 
egy thus far presented for the decay h — > 2a —> 4g and 
thus may be a viable search at the LHC. If cascade Higgs 
decays proceed primarily to 6 quarks then flavor tagging 
can increase the significance of the search dramatically. 
For the high range of scalar masses, 35 GeV < m a < ^ 
the options are more limited; assuming reasonable 6 tag- 
ging efficiency and purity, discovery significance can also 
be achieved with ~ 100 fb -1 . 
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